智能论文笔记

FuRPE: Learning Full-body Reconstruction from Part Experts

Zhaoxin Fan , Yuqing Pan , Hao Xu , Zhenbo Song , Zhicheng Wang , Kejian Wu , Hongyan Liu , Jun He

分类：计算机视觉

2022-11-30

Full-body reconstruction is a fundamental but challenging task. Owing to the lack of annotated data, the performances of existing methods are largely limited. In this paper, we propose a novel method named Full-body Reconstruction from Part Experts~(FuRPE) to tackle this issue. In FuRPE, the network is trained using pseudo labels and features generated from part-experts. An simple yet effective pseudo ground-truth selection scheme is proposed to extract high-quality pseudo labels. In this way, a large-scale of existing human body reconstruction datasets can be leveraged and contribute to the model training. In addition, an exponential moving average training strategy is introduced to train the network in a self-supervised manner, further boosting the performance of the model. Extensive experiments on several widely used datasets demonstrate the effectiveness of our method over the baseline. Our method achieves the state-of-the-art performance. Code will be publicly available for further research.

translated by 谷歌翻译

An Analysis of the Differences Among Regional Varieties of Chinese in Malay Archipelago

Nankai Lin , Sihui Fu , Hongyan Wu , Shengyi Jiang

分类：自然语言处理

2022-09-10

中国人在马来群岛各国的中国社区中突出特征。在这些国家，中国人经历了对当地语言和文化的调整过程，这导致每个国家发生中国变体。在本文中，我们对从五个马来群岛国家收集的中国新闻文本进行了定量分析看法。统计结果表明，这五个国家中使用的中国变体与现代中国大陆同行不同。同时，我们设法提取并分类了每个国家使用的几个中文单词。所有这些差异反映了中国人如何在海外发展，并证明了ROM当地社会和文化对中国发展的深远影响。

translated by 谷歌翻译

Efficient Adaptive Federated Optimization of Federated Learning for IoT

Zunming Chen , Hongyan Cui , Ensen Wu , Yu Xi

分类：机器学习 | 人工智能

2022-06-23

物联网（IoT）的扩散以及对设备进行感应，计算和通信功能的广泛使用，激发了人工智能增强的智能应用程序。经典人工智能算法需要集中的数据收集和处理，这些数据收集和处理在现实的智能物联网应用程序中，由于日益增长的数据隐私问题和分布式数据集。联合学习（FL）已成为一个分布式隐私的学习框架，该框架使IoT设备能够通过共享模型参数训练全局模型。但是，由于频繁的参数传输引起的效率低下会大大降低FL性能。现有的加速算法由两种主要类型组成，包括本地更新，考虑通信与计算之间的权衡以及参数压缩之间的权衡，考虑到通信和精度之间的权衡。共同考虑这两个权衡并适应平衡其对融合的影响尚未解决。为了解决该问题，本文提出了一种新型有效的自适应联合优化（EAFO）算法，以提高FL的效率，该算法通过共同考虑两个变量（包括本地更新和参数压缩）来最大程度地减少学习误差，并使FL能够自适应地调整两个变量和两个变量和两个变量。计算，沟通和精确度之间的平衡权衡。实验结果表明，与最先进的算法相比，提出的EAFO可以更快地实现更高的精度。

translated by 谷歌翻译

Object Level Depth Reconstruction for Category Level 6D Object Pose Estimation From Monocular RGB Image

Zhaoxin Fan , Zhenbo Song , Jian Xu , Zhicheng Wang , Kejian Wu , Hongyan Liu , Jun He

分类：计算机视觉

2022-04-04

最近，基于RGBD的类别级别6D对象姿势估计已实现了有希望的性能提高，但是，深度信息的要求禁止更广泛的应用。为了缓解这个问题，本文提出了一种新的方法，名为“对象级别深度重建网络”（旧网）仅将RGB图像作为类别级别6D对象姿势估计的输入。我们建议通过将类别级别的形状在对象级深度和规范的NOC表示中直接从单眼RGB图像中直接预测对象级的深度。引入了两个名为归一化的全局位置提示（NGPH）和形状吸引的脱钩深度重建（SDDR）模块的模块，以学习高保真对象级的深度和精致的形状表示。最后，通过将预测的规范表示与背面预测的对象级深度对齐来解决6D对象姿势。在具有挑战性的Camera25和Real275数据集上进行了广泛的实验，表明我们的模型虽然很简单，但可以实现最先进的性能。

translated by 谷歌翻译

ACR-Pose: Adversarial Canonical Representation Reconstruction Network for Category Level 6D Object Pose Estimation

Zhaoxin Fan , Zhengbo Song , Jian Xu , Zhicheng Wang , Kejian Wu , Hongyan Liu , Jun He

分类：计算机视觉 | 人工智能

2021-11-20

最近，随着重建规范3D表示的发展，类别级别的6D对象姿态估计已经取得了显着的改进。然而，现有方法的重建质量仍远非优秀。在本文中，我们提出了一种名为ACR-POSE的新型对抗性规范代表性重建网络。 ACR-POSE由重建器和鉴别器组成。重建器主要由两种新型子模块组成：姿势 - 无关模块（PIM）和关系重建模块（RRM）。 PIM倾向于学习Canonical相关的功能，使重建者对旋转和翻译不敏感，而RRM探讨不同输入模态之间的基本关系信息以产生高质量功能。随后，采用鉴别器来指导重建器以产生现实的规范表示。重构和鉴别者学会通过对抗性培训进行优化。普遍的NOCS相机和NOCS实际数据集的实验结果表明，我们的方法实现了最先进的性能。

translated by 谷歌翻译

SHLE: Devices Tracking and Depth Filtering for Stereo-based Height Limit Estimation

Zhaoxin Fan , Kaixing Yang , Min Zhang , Zhenbo Song , Hongyan Liu , Jun He

分类：计算机视觉

2022-12-22

Recently, over-height vehicle strike frequently occurs, causing great economic cost and serious safety problems. Hence, an alert system which can accurately discover any possible height limiting devices in advance is necessary to be employed in modern large or medium sized cars, such as touring cars. Detecting and estimating the height limiting devices act as the key point of a successful height limit alert system. Though there are some works research height limit estimation, existing methods are either too computational expensive or not accurate enough. In this paper, we propose a novel stereo-based pipeline named SHLE for height limit estimation. Our SHLE pipeline consists of two stages. In stage 1, a novel devices detection and tracking scheme is introduced, which accurately locate the height limit devices in the left or right image. Then, in stage 2, the depth is temporally measured, extracted and filtered to calculate the height limit device. To benchmark the height limit estimation task, we build a large-scale dataset named "Disparity Height", where stereo images, pre-computed disparities and ground-truth height limit annotations are provided. We conducted extensive experiments on "Disparity Height" and the results show that SHLE achieves an average error below than 10cm though the car is 70m away from the devices. Our method also outperforms all compared baselines and achieves state-of-the-art performance. Code is available at https://github.com/Yang-Kaixing/SHLE.

translated by 谷歌翻译

Towards Efficient and Domain-Agnostic Evasion Attack with High-dimensional Categorical Inputs

Hongyan Bao , Yufei Han , Yujun Zhou , Xin Gao , Xiangliang Zhang

分类：机器学习 | 人工智能

2022-12-13

Our work targets at searching feasible adversarial perturbation to attack a classifier with high-dimensional categorical inputs in a domain-agnostic setting. This is intrinsically an NP-hard knapsack problem where the exploration space becomes explosively larger as the feature dimension increases. Without the help of domain knowledge, solving this problem via heuristic method, such as Branch-and-Bound, suffers from exponential complexity, yet can bring arbitrarily bad attack results. We address the challenge via the lens of multi-armed bandit based combinatorial search. Our proposed method, namely FEAT, treats modifying each categorical feature as pulling an arm in multi-armed bandit programming. Our objective is to achieve highly efficient and effective attack using an Orthogonal Matching Pursuit (OMP)-enhanced Upper Confidence Bound (UCB) exploration strategy. Our theoretical analysis bounding the regret gap of FEAT guarantees its practical attack performance. In empirical analysis, we compare FEAT with other state-of-the-art domain-agnostic attack methods over various real-world categorical data sets of different applications. Substantial experimental observations confirm the expected efficiency and attack effectiveness of FEAT applied in different application scenarios. Our work further hints the applicability of FEAT for assessing the adversarial vulnerability of classification systems with high-dimensional categorical inputs.

translated by 谷歌翻译

A model-data asymptotic-preserving neural network method based on micro-macro decomposition for gray radiative transfer equations

Hongyan Li , Song Jiang , Wenjun Sun , Liwei Xu , Guanyu Zhou

分类：机器学习

2022-12-11

We propose a model-data asymptotic-preserving neural network(MD-APNN) method to solve the nonlinear gray radiative transfer equations(GRTEs). The system is challenging to be simulated with both the traditional numerical schemes and the vanilla physics-informed neural networks(PINNs) due to the multiscale characteristics. Under the framework of PINNs, we employ a micro-macro decomposition technique to construct a new asymptotic-preserving(AP) loss function, which includes the residual of the governing equations in the micro-macro coupled form, the initial and boundary conditions with additional diffusion limit information, the conservation laws, and a few labeled data. A convergence analysis is performed for the proposed method, and a number of numerical examples are presented to illustrate the efficiency of MD-APNNs, and particularly, the importance of the AP property in the neural networks for the diffusion dominating problems. The numerical results indicate that MD-APNNs lead to a better performance than APNNs or pure data-driven networks in the simulation of the nonlinear non-stationary GRTEs.

translated by 谷歌翻译

Detection of Strongly Lensed Arcs in Galaxy Clusters with Transformers

Peng Jia , Ruiqi Sun , Nan Li , Yu Song , Runyu Ning , Hongyan Wei , Rui Luo

分类：计算机视觉

2022-11-11

Strong lensing in galaxy clusters probes properties of dense cores of dark matter halos in mass, studies the distant universe at flux levels and spatial resolutions otherwise unavailable, and constrains cosmological models independently. The next-generation large scale sky imaging surveys are expected to discover thousands of cluster-scale strong lenses, which would lead to unprecedented opportunities for applying cluster-scale strong lenses to solve astrophysical and cosmological problems. However, the large dataset challenges astronomers to identify and extract strong lensing signals, particularly strongly lensed arcs, because of their complexity and variety. Hence, we propose a framework to detect cluster-scale strongly lensed arcs, which contains a transformer-based detection algorithm and an image simulation algorithm. We embed prior information of strongly lensed arcs at cluster-scale into the training data through simulation and then train the detection algorithm with simulated images. We use the trained transformer to detect strongly lensed arcs from simulated and real data. Results show that our approach could achieve 99.63 % accuracy rate, 90.32 % recall rate, 85.37 % precision rate and 0.23 % false positive rate in detection of strongly lensed arcs from simulated images and could detect almost all strongly lensed arcs in real observation images. Besides, with an interpretation method, we have shown that our method could identify important information embedded in simulated data. Next step, to test the reliability and usability of our approach, we will apply it to available observations (e.g., DESI Legacy Imaging Surveys) and simulated data of upcoming large-scale sky surveys, such as the Euclid and the CSST.

translated by 谷歌翻译

GIDP: Learning a Good Initialization and Inducing Descriptor Post-enhancing for Large-scale Place Recognition

Zhaoxin Fan , Zhenbo Song , Hongyan Liu , Jun He

分类：计算机视觉

2022-09-23

大规模的地方认可是一项基本但具有挑战性的任务，在自主驾驶和机器人技术中起着越来越重要的作用。现有的方法已经达到了可接受的良好性能，但是，其中大多数都集中精力设计精美的全球描述符学习网络结构。长期以来忽略了特征概括和描述后的特征概括和描述符的重要性。在这项工作中，我们提出了一种名为GIDP的新方法，以学习良好的初始化并引起描述符，以供大规模识别。特别是，在GIDP中分别提出了无监督的动量对比度云预处理模块和基于重新的描述符后增强模块。前者旨在在训练位置识别模型之前对Point Cloud编码网络进行良好的初始化，而后来的目标是通过推理时间重新掌握预测的全局描述符。在室内和室外数据集上进行的广泛实验表明，我们的方法可以使用简单和一般的点云编码主干来实现最先进的性能。

translated by 谷歌翻译